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ABSTRACT Self-incompatibility (SI) is a genetic system that prevents self-fertilization in many Angiosperms. KEYWORDS 

Although plants from the Brassicaceae family present an apparently unique SI system that is ancestral to the self- 
family, investigations at the S-locus responsible for SI have been mostly limited to two distinct lineages incompatibility 

(Brassica and Arabidopsis-Capsella, respectively). Here, we investigated SI in a third deep-branching lineage Biscutelleae 

of Brassicaceae: the tribe Biscutelleae. By coupling sequencing of the SI gene responsible for pollen recog- SRK 

nition (SRK) with phenotypic analyses based on controlled pollinations, we identified 20 SRK-like sequences controlled 
functionally linked to 13 S-haplotypes in 21 individuals of Biscutella neusthaca and 220 seedlings. We found crosses 

two genetic and phylogenetic features of SI in Biscutelleae that depart from patterns observed in the refer- dominance/ 
ence Arabidopsis clade: (1) SR/C-like sequences cluster into two main phylogenetic lineages interspersed recessivity 

within the many SRK lineages of Arabidopsis; and (2) some SR/C-like sequences are transmitted by linked genetics of sex 
pairs, suggesting local duplication within the S-locus. Strikingly, these features also were observed in the 
Brassica clade but probably evolved independently, as the two main SRK clusters in Biscutella are distinct from 
those in Brassica. In the light of our results and of what has been previously observed in other Brassicaceae, we 
discuss the ecological and evolutionary implications on SI plant populations of the high diversity and the 
complex dominance relationships we found at the S-locus in Biscutelleae. 



Self-incompatibility (SI) is a genetic system that prevents self-fer- 
tilization in plants (reviewed in Charlesworth 1987) and is present 
in approximately 40% of Angiosperm species (reviewed in Igic 
et al 2008). SI is generally controlled by a single locus (S-locus) 
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at which tightly linked genes forming coadapted haplotypic com- 
binations encode pollen and pistil specificities. Pollen is rejected 
after pollination if its specificity is encoded by the same haplo- 
type as that of the pistil. The determination of pollen phenotype 
occurs in two distinct flavors in SI systems. In gametophytic 
systems, the pollen phenotype is denned by its haploid genotype 
(reviewed in Franklin-Tong and Franklin 2003), whereas in spo- 
rophytic systems (SSIs), pollen phenotypes are controlled by the 
diploid genotype of the paternal plant (reviewed in Hiscock and 
Tabah 2003). 

The molecular mechanism of pollen rejection in SSI has been 
described only in Brassicaceae and involves the interaction of 
a cysteine-rich protein (SCR) deposited on the pollen surface 
(Schopfer et al 1999) with a transmembrane receptor kinase 
(SRK) of the stigma (Takasaki et al 2000). Knowledge about the genes 
involved in SI has allowed comparative investigations of molecular 
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diversity of both genes in different groups of Brassicaceae. 
Functional alleles of SRK have been found in several genera of 
Brassicaceae, including Brassica (Kusaba et al. 1997; Sato et al. 
2002), Raphanus (Lim et al. 2002), Capsella (Guo et al. 2009; Paetsch 
et al 2006), Arabidopsis (Castric and Vekemans 2007; Kusaba et al 
2001; Schierup et al 2001), Arabis (Tedder et al 2011), as well as 
functional alleles of a SRK ortholog in Leavenworthia (Busch et al 
2008), suggesting that the SCR/SRK SSI system is ancestral in this 
family. SRK alleles in these genera share the property of trans- 
specific or even trans-generic polymorphisms, as expected due to 
the strong negative frequency-dependent selection occurring at the 
S-locus (Schierup et al 1998). However, phylogenetic relationships 
among alleles differ in different groups. SI species of Arabidopsis 
and Capsella have SRK alleles distributed into many lineages, with 
highly diverged sequences, whereas SRK alleles in Brassica and Raphanus, 
taken together, cluster into only two sequence clades, called classes I 
and II, evolutionary distinct since they are intermingled with SRK 
lineages from Arabidopsis and Capsella (Figure 1 and Supporting 
information, Table SI) (Castric and Vekemans 2007; Edh et al 2009; 
Schierup et al 2001). In the genus Leavenworthia, the S alleles 
clustered into a single clade that is highly divergent from SRK alleles 
from the other genera (Busch et al 2008). This seems to be due to 
independent evolution of the pollen and pistil genes involved in SI, 
from different members of the same gene families as SCR and SRK 
(Chantha et al 2013). 

In SI species of Arabidopsis, only two functional genes, SRK and 
SCRy have been found within the S-locus (Goubet et al 2012; Guo 
et al 2011), whereas most Brassica haplotypes have a third gene 



(SLG), which is a paralog of SRK lacking its transmembrane and 
kinase domains (Nishio and Kusaba 2000; Takasaki et al 2000). Al- 
though SLG is not required for the SI reaction (Nishio and Kusaba 
2000), the tightly intermingled phylogeny of SRK and SLG alleles 
observed in Brassica suggests frequent gene conversion events between 
these two genes (Sato et al 2002; Takuno et al 2008). 

Another distinctive feature of SSI systems is the occurrence of 
dominance relationships between S-haplotypes in heterozygous genotypes. 
SI Brassica species have two main dominance classes of S-haplotypes, 
which correspond to the two phylogenetic clusters of haplotypes 
(Nasrallah et al 1991). In Arabidopsis, dominance relationships 
appear to be more complex with more than two dominance levels, 
showing only partial congruence with phylogenetic clustering of 
alleles (Llaurens et al 2008; Prigoda et al 2005). Dominance 
relationships are expected to affect S-haplotypes frequencies be- 
cause the negative frequency-dependent selection typical for SI 
systems (Wright 1939) will be more intense on dominant than on 
recessive haplotypes (reviewed in Billiard et al 2007), resulting in 
greater frequencies of recessive haplotypes in natural populations 
(Schierup et al 1997). This prediction has been validated empir- 
ically in Arabidopsis halleri (Llaurens et al 2008), A. lyrata 
(Mable et al 2003; Schierup et al 2006), and Brassica insularis 
(Glemin et al 2005). 

Although SI is well described and documented in Brassicaceae, 
there is still little clue about the evolutionary mechanisms responsible 
for the differences between species just outlined, e.g., regarding the 
patterns of phylogenetic clustering of alleles, the co-occurrence of 
a SRK paralog at the S-locus, and the patterns of dominance 




Figure 1 Phylogenetic relationships between the 29 SR/C-related sequences found in Biscutella neustriaca and sequences from other Brassi- 
caceae. The evolutionary trees were built using Neighbor-Joining with a Poisson corrected distance based on amino acid sequences. The 
proportion of replicate trees in which branches clustered together is indicated close to nodes (1000 bootstrap replicates). (A) The 29 SR/C-related 
sequences found in B. neustriaca cluster in three monophyletic groups similar in sequence to SRK {BneSRKL_A, B, and Q and paralogs ARK3 and 
Aiy327420 found in other other Brassicaceae. Aha, Arabidopsis halleri; Aly, A. lyrata; Ath, A. thaliana; Bol, Brassica oleracera; Bra, B. rapa; Bne, 
Biscutella neustriaca). (B) Phylogenetic relationships among SRK and SRK-like {SRKL) sequences in four species of Brassicaceae: Capsella gran- 
diflora (black); Brassica oleracera (gray), Arabidopsis lyrata (white), and B. neustriaca (dark gray). The three clades in Biscutella (continuous line) are 
distinct to the two clades of Brassica (dotted line). Accession numbers of SRK-related sequences are detailed in Table S1. 
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relationships. Because most studies focused on two divergent model 
groups (Brassica-Raphanus and Arabidopsis-Capsella, respectively) 
we investigates SSI in Biscutella neustriaca, a self- incompatible mem- 
ber of the distantly related Biscutelleae tribe (Franzke et al 2011). In 
a previous study, we denned eight phenotypic incompatibility groups 
by using a full diallele crossing design between 21 individuals of B. 
neustriaca, which implies that the species has at least eight functional S- 
haplotypes (Leducq et al 2010). 

We used molecular approaches to identify S&fC-like sequences 
in B. neustriaca individuals, together with segregation analyses, con- 
trolled pollinations and paternity assignment from open pollinations 
to relate the sequences to SI phenotypes and to investigate dominance 
among haplotypes. Two striking features of the SRfC-like sequences 
were observed: (1) most haplotypes appear to carry two linked SRK- 
like sequences; and (2) all but one of the S-RiC-like sequences grouped 
into distinct phylogenetic clusters whose sequences differ from Bras- 
sica ones. 

MATERIALS AND METHODS 
Identification of SRK-like sequences 

We used 21 B. neustriaca individuals (collection F0) from a collection of 
plants maintained in the Conservatoire Botanique National de Bailleul 
(France) that we previously studied by controlled pollinations to iden- 
tify incompatibility groups (Leducq et al 2010). We used these plants to 
identify putative SRK sequences and validate their SI function by in- 
vestigating association with the incompatibility groups. 

DNA was extracted from leaf material as described in Leducq et al 
(2010). Candidate SRK sequences were initially obtained by polymer- 
ase chain reaction (PCR) amplification with primers 13SEQ2 and 
SLGR as described in Schierup et al (2001). The PCR mixture 
(15 uX) contained 20 ng of DNA, IX buffer (Applied Biosystems, 
Foster City, CA), 2.5mM MgCl 2 , 400 uM Fermentas dNTP mix 
(Fermentas Canada, Burlington, ON, Canada), 150 u,g/mL BSA, 
0.5 mM each primer, and 0.05 U/uX Taq polymerase (Amplitaq 
DNA polymerase; Applied Biosystems). Amplifications were per- 
formed on a Mastercycler EpGradient Eppendorf thermocycler 
with the following conditions: 15 min at 95°, 40 cycles of three 
steps: (1) 45 sec at 95°, (2) 1 min at 50° and (3) 1 min at 72°, 
followed by 10 min at 72°. PCR products were purified using 
NucleoSpin Extract II kit from Macherey-Nagel and ligated and 
transformed into chemically competent bacteria as described in Cas- 
tric and Vekemans (2007). At least eight clones per individual were 
sequenced with the BigDye3.1 sequencing kit (Applied Biosystems) 
and loaded on a 3130 capillary sequencer. Sequences were edited 
using MEGA5 (Tamura et al 2011) and validated when identical 
copies were found in at least two clones. 

Validated sequences were checked for similarity with previously 
described sequences of SRK and S&fC- related genes from other Bras- 
sicaceae species using blast searches against the nucleotide database in 
Genbank, and against the genome of A. lyrata at the Phytozome 
website (http://www.phytozome.net/). In all positive clones sequenced 
in a first set of 16 individuals (135 clones), sequences from one type 
(BneARK3L) were found in all individuals (60% of all clones). This 
sequence is closely related to that of ARK3, a paralog of SRK (also 
called Aly8 in A. lyrata, and corresponding to NCBI gene ID 946052), 
which is not involved in pollen-pistil recognition (Charlesworth et al 
2003; Kusaba et al 2001). We therefore developed a method to mini- 
mize amplification of this gene before the cloning step intended to 
obtain SRK sequences. We identified a restriction site that was present 
only in this sequence and digested the DNA (purified PCR product) 



with the HmdIII restriction enzyme (A-AGCTT). The restriction was 
carried out overnight at 37° in a mixture (20 jjlL) containing 15 juuL of 
fresh purified PCR product, 0.2 mM Spermidin Sigma, 2 uX of IX 
Fermentas R buffer, and 0.05 U/uX Hindlll Fermentas restriction 
enzyme. Restriction products were mixed with loading dye and run 
at 1 10 V on 2% agarose gels in TBE buffer for 45 min. Fragments were 
visualized by ethidium bromide under ultraviolet light and compared 
with a 100-bp DNA ladder. Digested PCR products of BneARK3L 
gave two fragments (150 and 450 pb size). Undigested PCR fragments 
(^600 pb size) were extracted from the agarose gel, cloned, and 
sequenced as described previously. Of the 269 positive clones ob- 
tained, only 6% contained BneARK3L y but 37% contained another 
sequence type (Bne327420L) that is similar to an A. lyrata gene with 
unknown function (Hu et al 2011). We designed specific primers for 
Bne327420L to test whether it segregates at the S-locus and it proved 
to be unlinked (Table S2). The reaction mixture for PCR amplification 
(15 jjlL) contained 20 ng of DNA, IX buffer (Applied Biosystems), 
2.5mM MgCl 2 , 200 |jlM Fermentas dNTP mix, 200 |jLg/mL bovine 
serum albumin, 0.5 mm of each primer, and 0.025 U/uX Taq poly- 
merase (Applied Biosystems). Amplifications were performed on Eppen- 
dorf thermocycler with the following conditions: 15 min at 95°, 35 cycles 
of three steps: (1) 40 sec at 95°, (2) 40 sec at 60° and (3) 40 sec at 72°, 
and finally 10 min at 72°. PCR products were visualized as described 
previously. 

We performed a phylogenetic analysis of all our B. neustriaca 
sequences, together with a sample of SRK alleles and S&fC-related genes 
from Arabidopsis and Brassica species (Table SI). We used PhyML 3.0 
(Guindon et al 2010) on amino acid sequences with the LG model of 
substitution and 100 bootstraps. We identified several clades of SRK- 
related sequences from Biscutella by characterizing the largest mono- 
phyletic groups containing exclusively sequences from B. neustriaca. We 
estimated synonymous (tt s ) and nonsynonymous (7r N ) nucleotide di- 
vergence between sequences identified as belonging to the different 
clades, using DNAsp v.5 (Librado and Rozas 2009). For putative SRK 
alleles, we checked for signatures of positive selection at the codon level 
using CODEML from the PAML package (Yang 2007) by comparing 
the likelihood of the nested models M7 and M8, which differ by the 
existence in M8 of a category of codon sites under diversifying selection 
(go = dN/dS > 1). Putatively selected codons were determined by the 
Bayes empirical Bayes inference procedure (Yang et al 2005). Based on 
annotation of hypervariable regions previously identified in Brassica 
SRK alleles (Kusaba et al 1997) , we determined whether putatively 
selected codons were located within hypervariable regions. 

Based on the Si^C-like sequences, we developed a typing strategy by 
designing primers with Primer3 (http://primer3.sourceforge.net/) to spe- 
cifically target each candidate SRK sequence identified in collection F0 
(Table S2). PCR amplifications were performed as described previously 
for Bne327420L, and individuals were genotyped by absence/presence of 
a band on agarose gel. Because of the low divergence between some 
sequences, some nonspecific amplifications occurred (revealed by fre- 
quent or systematic amplification in all genotyped individuals, regardless 
of their incompatibility type). In these cases, PCR products were sys- 
tematically sequenced, as described previously, using the primers used 
for the amplifications. 

Segregation of SRK-like sequences and association with 
the incompatibility phenotype 

Our second aim was to verify that SRfC-like sequences are indeed allelic 
and that they segregate with the SI phenotype. For this we used segre- 
gation patterns in Fl offspring to test for associations between the SI 
phenotype and the occurrence of particular SRfC-like sequences in the 



G3'Genes | Genomes | Genetics 



Volume 4 June 2014 I Self-Incompatibility in Biscutelleae I 985 



FO collection. We also tested whether dominance occurs between the 
B. neustriaca S-haplotypes, as we previously inferred (Leducq et al. 
2010). For this purpose, we used two collections of seedlings obtained 
from 13 individuals of the FO collection (Table S3), representing the 
eight previously identified incompatibility groups (Leducq et al. 2010). 
The first collection of 82 seedlings (Flo) was obtained from open 
pollinations among seven FO individuals (four pollen donors and three 
pollen receptors) and paternities were assigned a posteriori with parent- 
age analysis (Leducq et al. 2010). This collection included eight groups 
of verified full sibs. The second collection consisted of five sets of 138 
seedlings (Flc) obtained from controlled pollinations among five pairs 
of compatible F0 individuals. These controlled crosses were performed 
as described in Leducq et al. (2010). 

To test whether observed genotype frequencies at the S-locus fol- 
lowed Mendelian expectations in the 13 cohorts of Fl individuals, we 
denned n as cohort size (which range from 5 to 50 in collection Fl, see 
Table S3) and randomly generated 100,000 hypothetical cohorts of 
individuals for each n value. Each simulated set of progeny (which 
we denote as a 'cohort') was assumed to be produced by a cross between 
two diploid individuals A and B with known genotype at the S-locus (or 
binary combinations of four possible S-haplotypes Si, S2, S3, and S4). 
We performed the analysis for two cases, depending on the number of 
different genotypes at the S-locus observed in the cohort: (1) individuals 
A and B both heterozygous {e.g., genotypes S1S2 and S3S4); (2) parent 
A homozygous (S1S1) and parent B heterozygous (S3S4; considered 
only when only one or two genotypes were present in the sibship). Each 
sibship from a cross could thus include either (1) genotypes SlS3 y SlS4 y 
S2S3, and S2S4 with expected frequency of 0.25 each, or (2) genotypes 
S1S3 and S1S4 with expected frequency of 0.5 each. For each cohort size 
n and each possible genotype, we used the distribution of simulated 
genotype frequencies to determine whether observed genotype frequen- 
cies differed significantly from those expected. 

To test for associations between the putative S-haplotypes and the 
SI phenotypes in the offspring, we performed 1472 controlled polli- 
nations between individuals of collection F0, Flo, and Flc. A total of 
1229 pollinations were performed between individuals putatively shar- 
ing at least one S-haplotype, and the others were between individuals 
not thought to share S-haplotypes, to control for pollen viability and 
stigma receptivity. In collection F0, we had 18 different combinations 
between 13 putative S-haplotypes. We made crosses between different 
individuals with the same S-locus genotype for 21 out of 35 genotypes 
tested. Controlled pollinations and determination of compatibility 
were performed as described in Leducq et al. (2010). 

RESULTS 

Identification of SRK-like sequences in 6. neustriaca 

Overall, our approach yielded 29 distinct sequences in the F0 indi- 
viduals (accession numbers KF905295-KF905324; Table S4). These 
sequences clustered into five clades, defined as the largest monophy- 
letic groups containing exclusively sequences from B. neustriaca. Two 
clades were closer to sequences of SRK paralogs than to SRK alleles 
from Arabidopsis or Brassica (Figure 1A). One of these clades includes 
five sequences (BneARK3L) that are similar to the ARK3 gene sequence 
in B. rapa (gene Bra 013522) and A. lyrata (Aly8 or NCBI gene ID 
946052). A second clade of four sequences (Bne327420L) is similar to 
a SRK paralog present in the A. lyrata genome (gene 327420) that lack 
a kinase domain, unlike ARK3. Specific PCR primers for Bne327420L 
yielded amplification products in all individuals from collection F0 
(data not shown), confirming that these do not correspond to S-locus 
alleles. The 20 remaining sequences were similar to those from SRK 



alleles from several Arabidopsis species, and we will refer to those as 
S-RiC-like (SRKL) sequences (Figure 1A). B. neustriaca SRKL sequences 
formed three distinct clades, which we call classes A, B, and C. The 
seven class A sequences are similar to AlySRK20 y a functional SRK 
allele of A. lyrata (also called SRKb in Kusaba et al. 2001). There were 
12 class B sequences, which are closest to AthSRKQ one of the three 
alleles still segregating at the S-locus in A. thaliana (Shimizu et al. 
2004), and to AlySRK36, a functional allele in A. lyrata (Bechsgaard 
et al. 2006). The third clade, class C, is represented by a single sequence 
similar to AhSRK12 y a functional A. halleri SRK allele (Castric and 
Vekemans 2007). All three clades of SRfC-like sequences in B. neustriaca 
are distinct from those of Brassica (Figure IB). 

Segregation of SRfC-like sequences from FO, F1 c , and 
F1 0 collections 

Patterns or presence and absence results for the 20 SRKL sequences 
among the 21 individuals of collection F0 are presented in Table 1. 
Despite the fact that B. neustriaca is diploid (Leducq et al. 2013; 
Tremetsberger et al. 2002), we observed up to four SRKL sequences 
per individual, sometimes including A, B, and C classes. However, we 
observed consistent associations between certain sequences. Hereafter, 
we denote a putative SRK sequence in a simpler form, for example using 
A01 as the abbreviation for BneSRKL_A01 y where A refers to class A. In 
this notation, we found, for instance, that A01 is associated with either 
A02 or A03, and B01 was always associated with B13, B04 with COi, B10 
with Bll, and B09 with B12 (Table 1). Genotyping of the sibships 
confirmed these associations and provided two more associations 
(A06 with A07 and B17 with B18). These associations suggest linkage 
disequilibrium between eight pairs of sequences, forming the following 
haplotypes at a single locus: A01-A02, A01-A03, A06-A07, 
B01-B13, B04-C01, B10-B11, B09~B12 y and B17-B18 (Table 2). 
Most of the associations involve sequence pairs from the same class 
(A or B). However, one haplotype (B04-C01) did not. In addition, several 



Table 1 SRK-like sequences identified in the FO collection 



Individual 


Sequences Identified 


Putative Genotypes 


1 


B09, BW, B11 a , B12 


B09B12/ B10B11 


2 


B09, BIO, B11 a , B12 


B09B12 / B10B11 


3 


BW, B11 


B10B11 / B10B11 C 


4 


A01a, A03, BW, B11 


A01aA03 / B10B11 


5 


A01b, A03, B09, B12 


A01bA03 / B09B12 


6 


A01a, A02, A01a, A03 a 


A01aA02/A01aA03 


7 


AO 1a, A03, BOS 


A01aA03 / B06 


8 


A06, A07, B06 


A06A07 / B06 


9 


B06, B18 


B06/B17 b B18 


10 


B01, B06 


B01B13 b /B06 


11 


B01, B13, BOA, C07 


B01B13 / B04C01 


12 


A05, B01, B13 


A05/ B01B13 


13 


A05, B04, CO 7 


A05/B04C01 


14 


A05, B04, CO 7 


A05/ B04C01 


15 


A05, B05 


A05 / B05 


16 


A05, B05 


A05 / B05 


17 


BOA, CO 7 


B04C01 / B04C01 C 


18 


AO 7 a, A02, BOA, C01 a 


A01aA02/ B04C01 


19 


B03, B06 


B03 / B06 


20 


B03, B05 


B03 / B05 


21 


A04, B01, B13 


A04/ B01B13 



Putative genotypes are deduced from patterns of segregation in the F1c and 
F1o collections (see Table 3). 
k Typed but not sequenced. 

Supposed by association. 

Homozygote genotypes. 
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Table 2 Pattern of SR/C-like sequences segregation observed in 13 cohorts of seedlings issued from crosses between individuals of 
collection FO 



Deduced Linkage Groups 



Parents 6 Seedlings Genotype (Observed Frequency) 0 Parent 1 Parent 2 



Seedling Collection 
F1o 



n a 


1 


2 


1 


2 


3 


4 


1 


2 


1 


2 


7 


3 


6 


AO/AOz 


A07A03 


" 




B70 


B70 


AO 7 


AO 7 








B10B11 


B 7 0B 7 7 






B7 7 


B7 7 


A02 


A03 








(4) 


(3) 














13 


3 


7 


A0/A03 
B10B11 
(/) 


B06B10 
B11 
(6) 






B70 
B77 


B70 
B77 


AO 7 
A03 


B06 


5 


8 


4 


A07A03 


A06A07 


A01A03 


- 


A06 


B06 


AO 7 


B70 








A06A07 


B10B11 


B06 




A07 




A03 


B77 








(2) 


(1) 


(2) 












7 


8 


6 


A07A03 


A07A02 


A01A03 


- 


A06 


B06 


AO 7 


AO 7 








A06A07 


B06 


BOS 




A07 




A02 


A03 








(1) 


(3) 


(3) 












21 


9 


4 


A f~\ A A AO 

A01A03 




A A 'f A A O 

A07A03 


A A / A A A 

B06B10 


BOS 




AO 7 


B70 








(4) 


(1) 


B06 
(10) 


B1 7 
(o) 






A03 


B7 7 


7 


9 


6 


A01A02 
(3) 


A A A AO 

A07A03 
(2) 


A A A A A O 

A07A02 
B06 
(2) 


— 


BOS 




AO 7 
A02 


AO 7 
A03 


6 


10 


4 


A07A03 
B0 7 
(1) 


B07B70 
B77 
(4) 


A07A03 
B06 

(1) 


- 


B01 d 


B06 


AO 7 
A03 


B70 
B77 


16 


10 


6 


A f~\ A A oo 

A01A02 


A f~\ A A OO 

A07A03 


A A <*f A A O 

A01A02 


A A *■! A A O 

A01A03 


B07 d 


B06 


AO 7 


AO 7 








B0 7 


B0 7 


BOS 


BOS 






A02 


A03 








(2) 


(6) 


(6) 


(2) 










14 


4 


20 


A /I A A /I O 

A07A03 


a r\ A A AO 

A01A03 


A A O A 'f A 

B03B70 


A A l~ A <*f A 

B05B70 


AO 7 


B70 


B03 


B05 








B03 


B05 


B7 7 


B7 7 


A03 


B7 7 












(3) 


(2) 


(4) 


(5) 










30 


12 


8 


A05A06 
A07 
(/) 


A05B06 
(6) 


A0SA07 
B0 7 
(6) 


B07B06 
(11) 


A05 


B0 7 


A06 
A07 
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Collection F1o (eight cohorts) was obtained from open pollinations and seedlings were assigned by paternity analyses (Leducq et a/. 2010). Collection F1c was 
obtained by controlled crosses. For each cohort, the following information is shown: number of seedlings, name of F0 parental individuals, SR/C-like sequences found 
in parents, associations of SR/C-like sequences found in progeny with frequency of association and deduced SR/C-like linkage groups in parents 
k Cohort size. 

F0 collection; see Table 2 for genotypes. 

F1 collection. 

B13, B17, and B18 could not be typed in progeny. 



sequences identified in the F0 collection, A04, A05, B03, B05, and B06, 
were not transmitted in pair. Using information about the eight hap- 
lotypes described previously, plus the five distinct haplotypes involving 
the five sequences just mentioned, allowed us to infer the genotype of 
each of the 21 F0 individuals (Table 1, Table 2, and Table 3). 

These results indicate that B. neustriaca has two SRKL genes in 
most of its S-locus haplotypes, and that most individuals we analyzed 
are heterozygotes. In two individuals (3 and 17) we found only one 
haplotype (haplotypes B10—B11 and B04—C01, respectively), so we 
hypothesize that these individuals could either be homozygotes for 
those haplotypes or heterozygotes with another haplotype, so far un- 
detected (Table 1 and Table 2). 



We tested the 13 putative S-haplotypes inferred from SRKL sequen- 
ces in sibships Flc and Flo. In eight of ten sibships the genotype fre- 
quencies followed Mendelian expectations from crossing two individuals 
heterozygous at the S-locus (case 1), but two Flo sibships had significant 
deviations frequencies (one for one genotype, and one for two). In three 
other sibships from crossing one heterozygous individual and individuals 
3 or 17, for which only a single haplotype was identified, genotype 
frequencies of 0.5 were found (case 2) and the alternative of more 
than two progeny genotypes was rejected (Table S3). This confirms 
that these two individuals are homozygotes for their respective 
haplotypes. Thus, all S-locus genotypes were inferred in the F0 
collection. 
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Table 3 Occurrence of putative S-haplotypes identified in FO collection (sequencing) 



S-Haplotype 


Seq 

i 


uence 

o 


Incompatibility Group 


Haplotype Co-occurrence in FO 


Cohorts With the Haplotype 3 


SO 7 


BW 


B11 


I 


5 


7 


S02 


A01 


A03 


II 


4 


9 


S03 


BOS 


- 


III 


5 


9 


S04 


B01 


B13 b 


IV 


4 


2 b 


S05 


A05 


- 


V 


5 


1 


S06 


B04 


CO 7 


VI 


6 


2 


S07 


A01 


A02 


VII 


2 


4 


cno 


DUO 




VIII 


Z 


1 


S09 


A06 


A07 




1 


4 


S10 


B05 






3 


1 


S11 


B09 


B12 




3 


1 


S12 


B17 b 


B18 b 




1 


0 b 


S13 


A04 






1 


1 



Incompatibility groups previously identified in FO collection by Leducq et a/. (2010) are indicated with associated S-haplotypes. 
Number of cohorts where segregation could be tested; see Table 2 and Table S4. 
B13, B17, and B18 were not typed in the progeny. 



Associations between SRK-like sequences and 
SI phenotypes 

As shown in Figure 3, the eight incompatibility groups (groups I— VIII) 
identified from S phenotypes in the FO collection (Leducq et al. 2010) 
correspond to groups of individuals sharing at least one SRKL sequence 
(putative SRK allele) and we will now refer to these as S-haplotypes SOI 
to S08 (Table 3). 

To confirm the link between these haplotypes and the SI phenotypes 
in the offspring and to validate it for other putative S-haplotypes, we 
performed 1472 additional controlled pollinations between individuals 
of collection Fl (Figure SI, Figure S2, Figure S3, Figure S4, Figure S5, 
Figure S6, Figure S7, Figure S8, Figure S9, Figure S10, Figure SI 1, Figure 
SI 2, Figure SI 3, and Figure SI 4). Figure 4 summarizes our results from 
all crosses between individuals of collections FO and FL A total of 71% 
of positive controls (243 crosses between individuals sharing no putative 
S-haplotype) yielded fruits vs. 9% of crosses between individuals sharing 
one putative S-haplotype (1229 crosses; Figure 4, A— D). These results 
confirmed the functional incompatibility types of the S-haplotypes pre- 
viously identified as S01 — S08 (Figure SI, Figure S2, Figure S3, Figure 
S4, Figure S5, Figure S6, Figure S7, and Figure S8) and validate the 
function of four other putative S-haplotypes S09 (A06/A07; Figure S9), 
S10 (B05; Figure S10), S12 (B17/B18; Figure S12) and S13 (A04; Figure 
S13), but not Sll (B09/B12; Figure Sll). Successful positive controls 
between individuals sharing no common sequence attested that these 
aborted pollinations most probably resulted from incompatibility reac- 
tions (Figure 4, A and B). Compatibility results were almost the same 



for different plants with identical genotypes (but see Figure S10 for an 
exception). Most crosses between individuals carrying S07 (A01/A02) 
and S02 (A01IA03) were fully compatible, indicating that, despite shar- 
ing the A01 sequence, these two haplotypes confer distinct incompati- 
bility types (Figure SI 4). The low sequence diversity we found within 
A01 sequences (one nonsynonymous nucleotide substitution separating 
sequences AO la and AO lb) was not associated with whether the other 
sequence in the haplotype was A02 or A03 (Table 1 and Table S4). 

Patterns of dominance among S-haplotypes 

In controlled pollinations among FO individuals, some pairs sharing one 
S-haplotype were fully compatible, indicating the occurrence of domi- 
nance relationships between S-haplotypes, as typical for sporophytic SI 
(Figure 3). Dominance relationships were not identical between pollen 
and stigmas. For instance, pollen from individual 4, carrying alleles SOI 
and S02 y was compatible with individual 3, carrying two copies of allele 
SOI (whose expression in stigmas was validated by unsuccessful pollina- 
tion by other individuals having SOI), but these individuals were incom- 
patible when individual 3 was the pollen donor (Figure 3 and Figure 4, 
A— D). Thus, allele SOI is expressed in stigma of individual 4 (and is thus 
either dominant or codominant with allele S02) but recessive to S02 in 
pollen. The varied compatibility in pollen and stigmas among the 
S-haplotypes in crosses between individuals sharing one S-haplotype 
(combining FO and Fl crosses) is shown in Figure 4, A— B. The average 
proportion of compatible crosses was lower for S-haplotypes of class 
BneSRK-A (3%) than for BneSRK-B (23%), and the difference was less 



Table 4 Analyses of nucleotide polymorphism and detection of positive selection in sequences within phylogenetic groups 
BneSRKL_A and BneSRKL_B 



Log Likelihood Positively Selected Sites 





^seq 


n sites 


7T S 


7T N 




M7 


M8 


-2ALn 


p-Value 


Sites 3 


Average dN/dS 


BneSRKL_A 


7 


528 


0.0993 


0.0777 


0.78 


-1432 


-1429 


6.37 


0.041 


262N 295S b 


4.110 


BneSRKL_B 


10 


465 


0.1705 


0.0984 


0.58 


-1714 


-1708 


11.71 


0.003 


274D b < c 


2.915 



276 b < d 
303V b < c 

Abbreviations: n seq , number of sequences analyzed; nsites, number of nucleotide sites; tt s and tt s , nucleotide diversity at synonymous and non-synonymomus sites, 
respectively. 

B0SRK6O used as reference. 

Sites in hypervariable regions. 
^ Sites also detected in Arabidopsis and Brassica. 

Site also detected in Arabidopsis. 
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Position (amino acids) 

Figure 2 Amino acid variation in two SRK-related sequence groups found in Biscutelia neustriaca. Height of bars is proportional to the number of 
different amino-acids at each position within a group. Relative frequency of each amino acid (symbolized by its letter) is proportional to its height 
in the bar. Black arrows indicate positions under positive selection in groups A and B (see Table 4) that fall into or close to hypervariable regions 
known to be involved in SRK-SCR protein recognition (HV2 and HV3 indicated by white frames). The alignment (100 amino-acids positions) was 
based on eight BneSRKL-A and 11 BneSRKL-B sequences in MEGA5. 



pronounced in stigmas (3 and 18% for A and B classes, respectively; 
Figure 4A) than in pollen (3 and 28% for A and B classes, respectively; 
Figure 4B). In stigmas, S-haplotypes of class A were expressed in all 
genotypes, whereas there was at least one case of recessivity for a class B 
haplotype (Figure 4C). We observed many more cases of recessivity in 



pollen, at least 10 cases, mostly involving two class B S-haplotypes 
(SOI and S06) y which were not expressed in most genotypes tested 
(Figure 4D). Overall, codominance was more frequent in stigmas 
than in pollen and recessive haplotypes generally belonged to class 
B (Figure 4E). 
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Figure 3 Results from controlled crosses realized among 21 individuals of collection F0 (Leducq etal. 2010) related with S-genotypes determined 
in this study. Linked groups of SR/C-like sequences found in each individual are indicated in second and third columns; corresponding S- 
haplotypes are indicated in fourth and fifth columns. Black boxes represent unsuccessful self-pollinations; gray boxes represent reciprocal un- 
successful cross-pollinations, filled with the expressed S-haplotype when shared between crossed individuals (sign equal when to crossed 
individuals have identical genotypes). Light gray box represent nonreciprocal unsuccessful cross-pollinations. Successful cross-pollinations are 
represented by plus signs (+). Unsuccessful cross pollinations realized between individuals sharing no S-haplotypes are indicated by minus signs 
(-). Unrealized pollinations are represented by crosses (x). 
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Figure 4 Dominance relationships determined by controlled pollinations between 13 S-haplotypes in B. neustriaca. Proportion of successful 
pollinations calculated for all crosses involving stigma (A) or pollen (B) of an individual with a given S-haplotype (x-axis). Crosses between individuals 
sharing or not sharing the haplotype are filled in black and gray, respectively. Pattern of SI expression for S-haplotypes involved in 35 possible 
genotypes in stigma (C) and pollen (D). Results are indicated for the haplotype X (in rows) as compared to haplotype Y (in columns) and report 
whether X was expressed (+) or not (R). Uncertain cases resulted from either the lack of evidence for X function (R?) or from inconsistencies between 
different individuals with the same genotype (+/R). (E) Dominance relationships among 12 S-haplotypes (S11 removed) in stigma (left) and pollen 
(right). Only nonambiguous cases were reported here. Dotted frames regroup haplotypes that are mostly codominant. Arrows indicate dominance 
relationships. Haplotypes from classes A and B are filed in gray and white respectively. See Figure 3, Figure S1, Figure S2, Figure S3, Figure S4, 
Figure S5, Figure S6, Figure S7, Figure S8, Figure S9, Figure S10, Figure S11, Figure S12, Figure S13, and Figure S14 for detailed crosses. 



Analysis of polymorphism among SRKL sequences and 
test of diversifying selection 

To test for positive selection resulting from negative frequency- 
dependent selection, we analyzed polymorphisms in the SRKL sequences. 
Diversity was found to be high among sequences of BneSRKL classes 
A and B at both synonymous (tt s = 9.9-17.0%) and non-synonymous 
sites (tt n = 7.8-9.8%; 528-465 nucleotide sites; Table 4, Figure 2); 



class C has a single sequence. PAML analysis yielded evidence for 
a class of sites evolving under diversifying selection for both classes 
A and B (Table 4). Using a cut-off at P = 0.95 in the Bayes empirical 
Bayes analysis, we detected 2 codons evolving under positive selec- 
tion for class A and 3 codons for class B (Table 4 and Figure 2). All 
three positively selected codons of class B were within regions cor- 
responding to the hypervariable regions of Brassica sequences 
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(Nishio and Kusaba 2000), and the same was true for one of the two 
class A selected codons (Figure 2). Three of these sites were 
also previously detected as positively selected in other Brassicaceae 
(Table 4). The posterior mean dN/dS for positively selected sites in 
classes A and B were 4.110 and 2.915, respectively (Table 4). Only 
one sequence, BneSRKL_B12, had a premature stop codon; this se- 
quence has a lbp deletion that resulted in a reading frame shift. 

DISCUSSION 

Molecular identification of SRK-like sequences linked to 
the SI phenotype in the tribe Biscutelleae 

Here, we identified &RiC-like sequences in B. neustriaca and demon- 
strated that these sequences segregate at a single locus and are tightly 
linked with the SI phenotype. Our results suggest that these SRfC-like 
sequences are functional SRK alleles similar to those in the Brassica- 
ceae SSI system previously described in Brassica (Kusaba et al. 1997; 
Sato et al. 2002), Capsella (Guo et al. 2009; Paetsch et al 2006), 
Arabidopsis (Castric and Vekemans 2007; Kusaba et al 2001; 
Schierup et al 2001), and Arabis (Tedder et al 2011) and not those 
of Leavenworthia, where the gene involved in stigma SI is not homol- 
ogous to the SRK of Arabidopsis and Brassica (Chantha et al 2013). 

SRK sequences in the Biscutella and Brassica lineages 
have several features in common 

The two large monophyletic groups, A and B, of B. neustriaca 
SRKL sequences resemble the situation observed in Brassica out- 
lined in the Introduction, where the two sequence groups represent 
dominant and recessive alleles. A demographic event such as 
a strong bottleneck may have caused the low SRK diversity in 
Brassica (Castric and Vekemans 2007; Edh et al 2009). The B. 
neustriaca clusters, including the single sequence of group C, do 
not cluster with those of Brassica. Moreover, the reduction in the 
number of lineages in B. neustriaca appears to have been followed 
by allelic diversification of two of the three surviving SRK lineages, 
with evidence of strong positive selection (dN/dS in the range 
2.92—4.11) of the same order of magnitude as inferred in Brassica 
(2.98; Castric and Vekemans 2007) and in Leavenworthia (3.49; 
Herman et al 2012) and contrasting with lower estimates for A. 
lyrata and A. halleri (1.49 and 1.44, respectively) whose S-locus 
diversification seems to be much more ancient. 

The duplicated SRKL sequences within B. neustriaca S-haplotypes 
also resemble the situation in Brassica, where SRK and SLG co- 
segregate at the S-locus (Nishio and Kusaba 2000; Takuno et al 
2008). Cosegregation of Si-related sequences also was recently ob- 
served in some genera of Papaveraceae, which have a gametophytic 
system (Paape et al 2011), and in Solaneaceae a set of genetically 
linked F-box genes collectively determines the pollen phenotype 
(Kubo et al 2010). We cannot currently determine whether the 
two B. neustriaca SRKL sequences together determine the pistil phe- 
notype, or, a single SRK sequence, as in Brassica. In one case, how- 
ever, we have evidence that a sequence is probably not involved in 
pistil recognition, since BneSRKL-AOl is shared by two functionally 
distinct S-haplotypes, S02 and S07. Hence, further functional inves- 
tigations should focus on the other BneSRKL sequences, A02 and A03 y 
also carried by these haplotypes. Genomic studies are now needed to 
determine the structure and genomic localization of the S-locus, and 
to obtain the full coding sequences of both &RiC-like copies within 
S-haplotypes, to determine whether both sequences have a functional 
kinase domain, or whether one of them consistently lacks this like 
Brassica SLG. 
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Number of S-alleles and patterns of dominance 

B. neustriaca is a narrow endemic species whose populations are 
highly disconnected, with restricted gene flow (Leducq et al 2013). 
Within natural populations, individuals typically aggregate in high- 
density patches (J.-B. Leducq, personal observations). In our previous 
study, we showed that highly reduced diversity at the S-locus and low 
local density could greatly reduce maternal reproductive success 
(Leducq et al 2010), i.e., there is a S-Allee effect (Wagenius et al 
2007). Given that B. neustriaca has at least 13 functionally distinct 
S-alleles in the 21 individuals sampled in collection F0, it might appear 
unlikely that ecological events could reduce allelic diversity enough to 
cause an overall reduced reproductive success in natural populations. 

Recessivity of the SI phenotype in B. neustriaca is commoner in 
pollen than in stigmas, where codominance is most frequent, as was also 
found in Brassica and Arabidopsis. The difference between dominance in 
pollen and stigmas is often interpreted as a consequence of the more 
intense sexual selection on the male than on the female function, because 
dominance in pollen allows a plant to minimize the chances of its pollen 
being rejected by stigmas from other plants (Schoen and Busch 2009). 
Interestingly, all recessive S-haplotypes except for one appear to belong to 
class B, suggesting that different dominance is associated with different S- 
haplotype lineages, as observed in Brassica (Nasrallah et al. 1991) and 
Arabidopsis (Llaurens et al. 2008; Prigoda et al. 2005). 

Because negative frequency-dependent selection acting on S-allele 
diversity is expected to be partially relaxed for recessive S-haplotypes, 
they are expected to reach greater frequencies in populations than 
dominant ones (Billiard et al. 2007; Llaurens et al. 2008; Schierup 
et al. 1997). Consistent with this prediction, the two most recessive 
S-haplotypes found in our sample of 21 individuals (S01 and S06) 
were the most frequent. It should now be tested whether this is also 
true in natural populations of B. neustriaca. 
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